NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Skipper: Enabling efficient SNN training through activation-checkpointing and time-skipping

https://doi.org/10.1109/MICRO56248.2022.00047

Singh, Sonali; Sarma, Anup; Lu, Sen; Sengupta, Abhronil; Kandemir, Mahmut T.; Neftci, Emre; Narayanan, Vijaykrishnan; Das, Chita R. (October 2022, 55th IEEE/ACM International Symposium on Microarchitecture (MICRO))

Full Text Available
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training

Sarma, Anup; Singh, Sonali; Jiang, Huaipan; Zhang, Rui; Kandemir, Mahmut; Das, Chita. (January 2022, Advances in neural information processing systems)

Full Text Available
Skipper: Enabling efficient SNN training through activation-checkpointing and time-skipping

Singh, Sonali; Sarma, Anup; Lu, Sen; Sengupta, Abhronil; Kandemir, Mahmut T.; Neftci, Emre; Narayanan, Vijaykrishnan; Das, Chita R (January 2022, Proceedings of the annual International Symposium on Microarchitecture)

Full Text Available
Gesture-SNN: Co-optimizing accuracy, latency and energy of SNNs for neuromorphic vision sensors

https://doi.org/10.1109/ISLPED52811.2021.9502506

Singh, Sonali; Sarma, Anup; Lu, Sen; Sengupta, Abhronil; Narayanan, Vijaykrishnan; Das, Chita R. (August 2021, 2021 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED))

Full Text Available
Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs

Sarma, Anup; Singh, Sonali; Jiang, Huaipan; Pattnaik, Ashutosh; Mishra, Asit K.; Narayanan, Vijaykrishnan; Kandemir, Mahmut T.; Das, Chita R. (September 2021, ArXivorg)

Full Text Available
NEBULA: A Neuromorphic Spin-Based Ultra-Low Power Architecture for SNNs and ANNs

https://doi.org/10.1109/ISCA45697.2020.00039

Singh, Sonali; Sarma, Anup; Jao, Nicholas; Pattnaik, Ashutosh; Lu, Sen; Yang, Kezhou; Sengupta, Abhronil; Narayanan, Vijaykrishnan; Das, Chita R. (May 2020, 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA))
null (Ed.)
Brain-inspired cognitive computing has so far followed two major approaches - one uses multi-layered artificial neural networks (ANNs) to perform pattern-recognition-related tasks, whereas the other uses spiking neural networks (SNNs) to emulate biological neurons in an attempt to be as efficient and fault-tolerant as the brain. While there has been considerable progress in the former area due to a combination of effective training algorithms and acceleration platforms, the latter is still in its infancy due to the lack of both. SNNs have a distinct advantage over their ANN counterparts in that they are capable of operating in an event-driven manner, thus consuming very low power. Several recent efforts have proposed various SNN hardware design alternatives, however, these designs still incur considerable energy overheads.In this context, this paper proposes a comprehensive design spanning across the device, circuit, architecture and algorithm levels to build an ultra low-power architecture for SNN and ANN inference. For this, we use spintronics-based magnetic tunnel junction (MTJ) devices that have been shown to function as both neuro-synaptic crossbars as well as thresholding neurons and can operate at ultra low voltage and current levels. Using this MTJ-based neuron model and synaptic connections, we design a low power chip that has the flexibility to be deployed for inference of SNNs, ANNs as well as a combination of SNN-ANN hybrid networks - a distinct advantage compared to prior works. We demonstrate the competitive performance and energy efficiency of the SNNs as well as hybrid models on a suite of workloads. Our evaluations show that the proposed design, NEBULA, is up to 7.9× more energy efficient than a state-of-the-art design, ISAAC, in the ANN mode. In the SNN mode, our design is about 45× more energy-efficient than a contemporary SNN architecture, INXS. Power comparison between NEBULA ANN and SNN modes indicates that the latter is at least 6.25× more power-efficient for the observed benchmarks.
more » « less
Full Text Available
CASH: compiler assisted hardware design for improving DRAM energy efficiency in CNN inference

https://doi.org/10.1145/3357526.3357536

Sarma, Anup; Jiang, Huaipan; Pattnaik, Ashutosh; Kotra, Jagadish; Kandemir, Mahmut Taylan; Das, Chita R. (January 2019, International Symposium on Memory Systems)

The advent of machine learning (ML) and deep learning applications has led to the development of a multitude of hardware accelerators and architectural optimization techniques for parallel architectures. This is due in part to the regularity and parallelism exhibited by the ML workloads, especially convolutional neural networks (CNNs). However, CPUs continue to be one of the dominant compute fabric in datacenters today, thereby also being widely deployed for inference tasks. As CNNs grow larger, the inherent limitations of a CPU-based system become apparent, specifically in terms of main memory data movement. In this paper, we present CASH, a compiler-assisted hardware solution that eliminates redundant data-movement to and from the main memory and, therefore, reduces main memory bandwidth and energy consumption. Our experimental evaluations on a set of four different state-of-the-art CNN workloads indicate that CASH provides, on average, ~40% and ~18% reductions in main memory bandwidth and energy consumption, respectively.
more » « less
Full Text Available

Search for: All records